AITopics | line show

Collaborating Authors

line show

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

04f61ec02d1b3a025a59d978269ce437-Supplemental-Conference.pdf

Neural Information Processing SystemsDec-27-2025, 22:22:11 GMT

artificial intelligence, line show, median score, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

A Proofs

Neural Information Processing SystemsNov-14-2025, 15:05:41 GMT

First, we recall basic properties of convex conjugate functions that we rely on in our proofs. The latter condition holds, e.g., for strongly convex functions. For each n = 1, 2,...,N we perform the following evaluation: ξ W First, we prove the congruence, i.e., βψ We provide the hyperparameters of all the experiments with algorithm 1 in Table 3. We use Adam optimizer with the default betas. Gaussian case, we use a single GPU GTX 1080ti.

artificial intelligence, barycenter, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

47ed62021460f2e9bba7be3e74260090-Supplemental-Conference.pdf

Neural Information Processing SystemsNov-14-2025, 04:08:13 GMT

artificial intelligence, machine learning, tl lattice, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.74)

Add feedback

Rectified Noise: A Generative Model Using Positive-incentive Noise

Gu, Zhenyu, Xu, Yanchen, Huang, Sida, Guo, Yubin, Zhang, Hongyuan

arXiv.org Artificial IntelligenceNov-13-2025

Rectified Flow (RF) has been widely used as an effective generative model. Although RF is primarily based on probability flow Ordinary Differential Equations (ODE), recent studies have shown that injecting noise through reverse-time Stochastic Differential Equations (SDE) for sampling can achieve superior generative performance. Inspired by Positive-incentive Noise (pi-noise), we propose an innovative generative algorithm to train pi-noise generators, namely Rectified Noise (RN), which improves the generative performance by injecting pi-noise into the velocity field of pre-trained RF models. After introducing the Rectified Noise pipeline, pre-trained RF models can be efficiently transformed into pi-noise generators. We validate Rectified Noise by conducting extensive experiments across various model architectures on different datasets. Notably, we find that: (1) RF models using Rectified Noise reduce FID from 10.16 to 9.05 on ImageNet-1k. (2) The models of pi-noise generators achieve improved performance with only 0.39% additional training parameters.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2511.07911

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

Enhancing Fractional Gradient Descent with Learned Optimizers

Sobotka, Jan, Šimánek, Petr, Kordík, Pavel

arXiv.org Machine LearningOct-22-2025

Fractional Gradient Descent (FGD) offers a novel and promising way to accelerate optimization by incorporating fractional calculus into machine learning. Although FGD has shown encouraging initial results across various optimization tasks, it faces significant challenges with convergence behavior and hyperparameter selection. Moreover, the impact of its hyperparameters is not fully understood, and scheduling them is particularly difficult in non-convex settings such as neural network training. To address these issues, we propose a novel approach called Learning to Optimize Caputo Fractional Gradient Descent (L2O-CFGD), which meta-learns how to dynamically tune the hyperparameters of Caputo FGD (CFGD). Our method's meta-learned schedule outperforms CFGD with static hyperparameters found through an extensive search and, in some tasks, achieves performance comparable to a fully black-box meta-learned optimizer. L2O-CFGD can thus serve as a powerful tool for researchers to identify high-performing hyperparameters and gain insights on how to leverage the history-dependence of the fractional differential in optimization.

artificial intelligence, l2o-cfgd, machine learning, (13 more...)

arXiv.org Machine Learning

2510.18783

Country:

Europe > Czechia > Prague (0.05)
North America > United States > California > San Diego County > San Diego (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Multi-agent debate - multiple instances of large language models discussing problems in turn-based interaction - has shown promise for solving knowledge and reasoning tasks. However, these methods show limitations, particularly when scaling them to longer reasoning chains. In this study, we unveil a new issue of multi-agent debate: discussions drift away from the initial problem over multiple turns. We define this phenomenon as problem drift and quantify its presence across ten tasks (i.e., three generative, three knowledge, three reasoning, and one instruction-following task). To identify the reasons for this issue, we perform a human study with eight experts on discussions suffering from problem drift, who find the most common issues are a lack of progress (35% of cases), low-quality feedback (26% of cases), and a lack of clarity (25% of cases). To systematically address the issue of problem drift, we propose DRIFTJudge, a method based on LLM-as-a-judge, to detect problem drift at test-time. We further propose DRIFTPolicy, a method to mitigate 31% of problem drift cases. Our study can be seen as a first step to understanding a key limitation of multi-agent debate, highlighting pathways for improving their effectiveness in the future.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2502.19559

Country:

Europe > Germany > Lower Saxony > Gottingen (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Asia > China (0.05)
(9 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Feature-Specific Coefficients of Determination in Tree Ensembles

Jiang, Zhongli, Zhang, Dabao, Zhang, Min

arXiv.org Machine LearningJul-3-2024

Tree ensemble methods provide promising predictions with models difficult to interpret. Recent introduction of Shapley values for individualized feature contributions, accompanied with several fast computing algorithms for predicted values, shows intriguing results. However, individualizing coefficients of determination, aka $R^2$, for each feature is challenged by the underlying quadratic losses, although these coefficients allow us to comparatively assess single feature's contribution to tree ensembles. Here we propose an efficient algorithm, Q-SHAP, that reduces the computational complexity to polynomial time when calculating Shapley values related to quadratic losses. Our extensive simulation studies demonstrate that this approach not only enhances computational efficiency but also improves estimation accuracy of feature-specific coefficients of determination.

artificial intelligence, machine learning, shapley value, (17 more...)

arXiv.org Machine Learning

2407.03515

Country:

North America > United States > California > Orange County > Irvine (0.14)
North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
Europe > Middle East > Malta (0.04)

Genre: Research Report (0.40)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.46)

Add feedback

Neural Network Compression for Reinforcement Learning Tasks

Ivanov, Dmitry A., Larionov, Denis A., Maslennikov, Oleg V., Voevodin, Vladimir V.

arXiv.org Artificial IntelligenceMay-13-2024

In the last decade, neural networks (NNs) have driven significant progress across various fields, notably in deep reinforcement learning, highlighted by studies like [1, 2, 3]. This progress has the potential to make changes in many areas such as embedded devices, IoT and Robotics. Although modern Deep Learning models have demonstrated impressive gains in accuracy, their large sizes pose limits to their practical use in many real-world applications [4]. These applications may impose requirements in energy consumption, inference latency, inference throughput, memory footprint, real-time inference and hardware costs. Numerous studies have attempted to make neural networks more efficient.

artificial intelligence, machine learning, neural network, (13 more...)

arXiv.org Artificial Intelligence

2405.07748

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.05)
Asia > Russia (0.05)
Europe > Russia > Volga Federal District > Nizhny Novgorod Oblast > Nizhny Novgorod (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback